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EBI Dbfetch 

ID CHCLOSI standard; genomic DNA; PRO; 2768 BP. 
XX 

AC X63673; 
XX 

SV X'63673.1 
XX ' 

DT 21-FEB-1992 (Rel. 31, Created) 

DT 19-JUL-1993 (Rel. 36, Last updated, Version 12) 
XX 

DE C .histolyticum closl gene for alpha-clostripain 
XX 

KW closl gene; Clostripain. 
XX 

OS Clostridium histolyticum 

OC Bacteria; Firmicutes; Clostridia; Clostridiales ; Clostridiaceae ; 

OC Clostridium. 

XX 

RN [1] 

RP 1-2368 

RA Diefenthal T. ; 

RT 

RL Submitted (28 - JAN-1992 ) to the EMBL/ GenBank/DDBJ databases. 

RL T. Diefenthal, Weissheimer Research, Dept. Biotechnology, Schaarstr.l, 

RL P.O.B. 20 .65, W-5470 Andernach, FRG 

XX 

RN [2] 

RP 1-2368 

RX DOI; 10.1007/BF00276893 

RX MEDLINE; 93341452 . 

RX PUBMED; 8341259 . 

RA Dargatz H., Diefenthal T., Witte V., Reipen G., von Wettstein D.; 

RT "The heterodimeric protease clostripain from Clostridium histolyticum is 

RT encoded by a single gene"; 

RL Mol. Gen. Genet. 240 ( 1) : 140-145 ( 1993 ) . 



XX 

FH Key Location/Qualifiers 
FH 

FT source 1..2768 

FT / db_x r e f = " taxon: 1498 " 

FT /mol_type = " genomic DNA" 

FT /organism="Clostridium histolyticum" 

FT -35 signal 376.. 381 

FT -10 signal 397.. 402 

FT repeat region 420. .438 

FT /note="palindrom-structure" 

FT / rpt_type=INVERTED 

FT RBS 605 . . 609 

FT CDS 618.. 2198 

FT /db xref=" GOA: P09870 " 

FT / db_x r e f = " UniProt/Swiss-Prot : P09870 " 

FT /transl_table=ll 

FT /gene="ClosI" 

FT /EC number=" 3.4.22.8 " 
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FT /product="clostripain" 

% FT /protein id=" CAA45212 . 1" 

FT /trans 1 at i on= "MLRRKVS TLLMTALI TTS FLNS KPVYANPVTKS KDNNLKEVQQVT 

FT S KS NKNKNQKVT IMY YC DADNNLEGS LLND I EEMKTGYKDS PNLNL I AL VD RS PRYS S D 

FT EKVLGEDFSDTRLYKIEHNKANRLDGKNEFPEISTTSKYEANMGDPEVLKKFIDYCKSN 

FT . YEADKYVLIMANHGGGAREKSNPRLNRAICWDDSNLDKNGEADCLYMGEISDHLTEKQS 

FT VDLLAFDACLMGTAEVAYQYRPGNGGFSADTLVAS S PWWGPGFKYDKI FDRI KAGGGT 

FT NNEDDLTLGGKEQNFDPATITNEQLGALFVEEQRDSTHANGRYDQHLSFYDLKKAESVK 

FT RAIDNLAVNLSNENKKSEIEKLRGSGIHTDLMHYFDEYSEGEWVEYPYFDVYDLCEKIN 

FT KSENFSSKTKDLASNAMNKLNEMIVYS FGDPSNNFKEGKNGLS IFLPNGDKKYSTYYTS 

FT TKIPHWTMQSWYNSIDTVKYGLNPYGKLSWCKDGQDPEINKVGNWFELLDSWFDKTNDV 

FT TGGVNHYQW" 
XX 

SQ Sequence 2768 BP; 1084 A; 359 C; 427 G; 898 T; 0 other; 



gctataatca atgccatata tataaagatt aaagtaatct tttgtatcat ctaataatgg 60 

agtatcagct ttattaatta attgcttttt taaaaaatta tttgtaagtt agtatactct 120 

tctactaaat agttaaattc agcttttcat ttttccatca ctaccataat tacgtaatca 180 

ttatatcata gaatgctttt accacgtaat ataaaaagca agagtcctat attaaatcct 240 

tgaacctaaa gacctaatgt agatttttaa caaaacaaat gattgatcac ttaatatatc 300 

taacagttca gataacttcc tttttgagat tgtaatatat aagtaaattc aaagaccata 360 

aaatataccc cccgattgta atgcttgaat attaattata atgaaattat agtttaaatt 420 

aaaaaaataa tttttttatg aagtttaaga aggattatcc taaacattat agaatatatt 480 

aaatataaaa ataatgtaaa actttaaata ataactctta taatggtaca taatgtattt 540 

atatggatta aattttaatt tatagatgct ataaaactat aaattaaatt taaaatatac 600 

ttttggaggt atgtattatg ttaagaagaa aagtatcaac actattaatg acagctttga 660 

taactacttc atttttaaat tccaaacccg tatatgcaaa tccagtaact aaatccaagg 720 

ataataactt aaaagaagta caacaagtta caagcaagag taataaaaac aaaaatcaaa 780 

aagtaactat tatgtactat tgcgacgcag acaataactt ggaaggaagt ctattaaatg 840 

atatcgagga aatgaaaaca ggatataagg atagtcctaa tttaaattta attgctcttg 900 

tagacagatc cccaagatat agcagtgacg aaaaagtttt aggtgaagat tttagtgata 960 

cacgtcttta taagattgaa cacaataagg caaatagatt agacggtaaa aatgaatttc 1020 

cagaaataag tactactagt aaatatgaag ctaacatggg ggatcctgaa gttcttaaaa 1080 

aatttattga ttattgtaaa tctaattatg aggctgataa atatgtgctt ataatggcta 1140 

atcatggtgg tggtgcaagg gaaaaatcaa atccaagatt aaatagagca atttgctggg 1200 

atgatagtaa ccttgataaa aatggtgaag cagactgcct ttatatgggt gaaatttcag 12*60 

atcatttaac agaaaaacaa tcagttgatt tacttgcctt tgatgcatgc cttatgggaa 1320 

ctgcagaagt agcgtatcag tatagaccag gtaatggagg attttctgcc gatactttag 1380 

ttgcttcaag cccagtagtt tggggtcctg gattcaaata tgataagatt ttcgatagga 1440 

taaaagctgg tggaggaact aataatgagg atgatttaac tttaggtggt aaagaacaaa 1500 

actttgatcc tgcaaccatt accaatgagc aattaggtgc attatttgta gaagagcaaa 15 60 

gagactcaac acatgccaat ggtcgctatg atcaacactt aagcttttat gatttaaaga 1620 

aagctgaatc agtaaaaaga gccatagata atttagctgt taatctaagt aatgaaaaca 1680 

aaaaatctga aattgaaaaa ttaagaggaa gtggaattca tacagattta atgcattact 1740 

tcgatgaata ttctgaagga gaatgggttg aatatcctta ttttgacgtg tatgatttat 1800 

gtgaaaaaat aaataaaagt gaaaatttta gtagtaaaac taaagattta gcttcaaatg 18 60 

ctatgaataa attaaatgaa atgatagttt attcttttgg agaccctagt aataatttta 1920 

aagaaggaaa aaatggattg agtatattct tacctaatgg agataaaaaa tattcaactt 1980 

attatacatc aaccaagata cctcattgga ctatgcaaag .ttggtataat tcaatagata 2040 

cagttaaata tggattgaat ccttacggaa aattaagttg gtgtaaagat ggacaagatc 2100 

ctgaaataaa taaagttgga aattggtttg aacttctaga ttcttggttt gataaaacta 2160 

atgatgtaac tggaggagtt aatcattacc aatggtaaaa ctcaatatta ctataattga 2220 

attgtaattt ataaaattta tattcaacta aaaacttctg cctcaatcta gaaaattgcc 2280 

tttctaaatt gaggcaatat tttaataatt cattccttta tttcaactat tcttcttatg 2340 

aaactgcttt ttgtctagat aaacaataga tgctccaaca attagtattc ctcctataat 2400 

agaatataca gataaagctt ctccaaataa aataaatccc caagcggatt aaacaatatt 2460 

cctatatatt acaacttcta caattactgc attagcattt ccgaaagcac ctgttaagaa 2520 

tatttgtgct aataaggaaa cggcacctaa caagattaac aggattaatt cttttgctgt 2580 

tggtattaca aagttatttc tcatcaatac tccagcaact atagtagatg cagctaagaa 2640 

ataaaacact atttcgtaat gatgatgttt cttagacttt gctaattttc ttatggtaat 2700 

tgaagcactt gccgataaaa ctgcacttaa catccaaaga tggcatagaa agaataagta 2760 

gagtatct 2768 

// 
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[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] 

[Features] [Sequence] [Tools] 

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents. 

Entry information 



Entry name 

Primary accession number 
Secondary accession number 
Entered in Swiss-Prot in 
Sequence was last modified in 
Annotations were last modified in 
Name and origin of the protein 
Protein name 
Synonyms 



CLOS_CLOHI 
P09870 

P09869 

Release 10, March 1989 
Release 26, July 1993 
Release 45, October 2004 



Alpha-clostripain [Precursor] 
EC 3.4.22.8 
Clostridiopeptidase B 
Gene name Name: cloSI 

From Clostridium histolyticum [TaxID: 1498] 

Taxonomy Bacteria; Firmicutes; Clostridia; Clostridials; Clostridiaceae; 

Clostridium. 

References 

[1] NUCLEOTIDE SEQUENCE. 

MEDLINE=93341452;PubMed=8341259 [NCBI, ExPASy, EBI, Israel, Japan] 
Dargatz H., Diefenthal T., Witte V., Reipen G., von Wettstein D.; 

"The heterodimeric protease clostripain from Clostridium histolyticum is encoded by a single 
gene."; 

Mol. Gen. Genet. 240: 140-145(1993). 
[2] PROTEIN SEQUENCE OF 5 1 - 1 8 1 . 

MEDLINE=85076641;PubMed=6391922 [NCBI, ExPASy, EBI, Israel, Japan] 

Gilles A.M., Lecroisey A., Keil B.; 

"Primary structure of alpha-clostripain light chain."; 

Eur. J. Biochem. 145:469-476(1984). 

[3] PRELIMINARY PROTEIN SEQUENCE OF 5 1-73 AND 191-232. 

MEDLINE=83131688;PubMed=6337850 [NCBI, ExPASy, EBI, Israel, Japan] 
Gilles A.M., de Wolf A., Keil B .: 
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> " Amino-acid sequences of the active-site sulfhydryl peptide and other thiol peptides from the 
cysteine proteinase alpha-clostripain."; 
Eur. J. Biochem. 130:473-479(1983). 

Comments 

• FUNCTION: Cysteine endopeptidase with strict specificity. 

• CATALYTIC ACTIVITY: Preferential cleavage: Arg-|-Xaa, including Arg-|-Pro bond, but not 
Lys-|-Xaa. 

• SUBUNIT: Heterodimer of a light chain and an heavy chain held together by strong noncovalent 
forces rather than by intramolecular disulfide bridges. 

• SIMILARITY: Belongs to the peptidase CI 1 family [view classification]. 

• DATABASE NAME=Worthington enzyme manual; WWW= H http://www.worthington- 
biochem.com/CP/" . 

Copyright 

This Swiss-Prot entry is copyright. It is produced through a collaboration between the Swiss Institute of Bioinfomiatics and 
the EMBL outstation - the European Bioinformatics Institute. There are no restrictions on its use as long as its content is in 
no way modified and this statement is not removed. 

Cross-references 

X63673; CAA45212.1; -.[EMBL / GenBank / DDBJ] [CoDing Sequence] 

A29174; A29174. 
A29175; A29175. 
B29175;B29175. 
S35190; S35190. 

C11.001;-. 

IPR005077; Peptidase_Cl 1 . 
Graphical view of domain structure. 

PF03415;Peptidase_Cll; 1. 
Pfam graphical view of domain structure. 
[Domain structure / List of seq. sharing at least 1 domain] 
[Family / Alignment / Tree] 
P09870. 
P09870. 
P09870. 
P09870. 
P09870. 
P09870. 

P09870; E151372FF6C95BE7. 
Get region on 2D PAGE. 

View cluster of proteins with at least 50% / 90% identity. 



EMBL 
PIR 

MEROPS 
InterPro 

Pfam 

ProDom 
HOBACGEN 
BLOCKS 
ProtoNet 
ProtoMap 
PRESAGE 
DIP 

ModBase 
SMR 

SWISS -2DPAGE 
UniRef 
Keywords 

Direct protein sequencing; Hydrolase; Signal; Thiol protease; Zymogen. 
Features 



Feature table viewer 




Feature aligner 



Key 

SIGNAL 



From 

1 



To Length 

27 27 



Description 

Potential . 
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1 
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CONc L1CI 


1Z / 


TOT 




K — > N^L (in Kei . £) . 




CONc LI CI 


I/O 


i / y 




HGGG — > GDGn (in Kei. £} 




CONFLICT 


197 


197 




S -> H (in Ref . 3) . 




CONFLICT 


213 


213 




I -> L (in Ref. 3) . 




CONFLICT 


216 


216 




H -> T (in Ref. 3) . 




CONFLICT 


232 


232 




L -> M (in Ref. 3) . 





Sequence information 

Length: 526 AA [This is the Molecular weight: 59733 Da 

length of the unprocessed [This is the MW of the 

precursor] unprocessed precursor] 



10 



20 



30 



40 



CRC64: E151372FF6C95BE7 [This 
is a checksum on the sequence] 



50 



60 



MLRRKVSTLL MTALITTSFL NSKPVYANPV TKSKDNNLKE VQQVTSKSNK NKNQKVTIMY 



70 80 90 100 110 120 

YCDADNNLEG SLLNDIEEMK TGYKDSPNLN LIALVDRSPR YSSDEKVLGE DFSDTRLYKI 



130 14 0 150 160 17 0 18 0 

EHNKANRLDG KNEFPEISTT SKYEANMGDP EVLKKFIDYC KSNYEADKYV LIMANHGGGA 

190 200 210 22 0 23 0 24 0 

REKSNPRLNR AICWDDSNLD KNGEADCLYM GEISDHLTEK QSVDLLAFDA CLMGTAEVAY 

250 2 60 27 0 28 0 2 90 30 0 

QYRPGNGGFS ADTLVASSPV VWGPGFKYDK IFDRIKAGGG TNNEDDLTLG GKEQNFDPAT 

310 32 0 33 0 34 0 350 360 

ITNEQLGALF VEEQRDSTHA NGRYDQHLSF YDLKKAESVK RAIDNLAVNL SNENKKSEIE 

370 380 390 400 410 420 

KLRGSGIHTD LMHYFDEYSE GEWVEYPYFD VYDLCEKINK SENFSSKTKD LASNAMNKLN 

430 440 450 460 470 480 

EMIVYSFGDP SNNFKEGKNG LSIFLPNGDK KYSTYYTSTK IPHWTMQSWY NSIDTVKYGL 

490 500 510 520 

NPYGKLSWCK DGQDPEINKV GNWFELLDSW FDKTNDVTGG VNHYQW P09870 in FASTA 

format 



View entry in original Swiss-Prot format 

View entry in raw text format (no links) 

Report form for errors/updates in this Swiss-Prot entry 



BLAST submission on 
BLAST ExPASy/SIB 

or atNCBI (USA) 




Sequence analysis tools: ProtParam, ProtScale, 
Compute pI/Mw, PeptideMass, PeptideCutter, 
Dotlet (Java) 



ScanProsite, MotifScan 



Search the SWISS-MODEL Repository 
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*File 399: Use is subject to the terms of your user/customer agreement. 
Alert feature enhanced for multiple files, etc. See HELP ALERT. 
-File 434:SciSearch(R) Cited Ref Sci 1974-1989/Dec 
(c) 1998 Inst for Sci Info 
*File 434: Price change effective Jan 1, 2005. Enter HELP 
RATES 434 for details. 

File 444:New England Journal of Med. 1985-2005/Feb Wl 

(c) 2005 Mass. Med. Soc. 
File 467 : ExtraMED ( tm) 2000/Dec 
(c) 2001 Informania Ltd. 
*File 467: F4 67 no longer updates; see Help News4 67. 7. 

Set Items Description 



Cost is in DialUnits 
?ds 

Set Items Description 

51 16 AU=DARGATZ ? AND PY=1993 

52 4 SI AND HISTOLYTICUM? 
?t S2/9/1 

2/9/1 (Item 1 from file: 155) 

DIALOG (R) File 155 : MEDLINE ( R) 

(c) format only 2005 The Dialog Corp. All rts . reserv. 

09780560 PMID: 8341259 

The heterodimeric protease clostripain from Clostridium histolyticum is 
encoded by a single gene. 

Dargatz H ; Diefenthal T; Witte V; Reipen G; von Wettstein D 

Weissheimer Research Laboratory, Andernach, Germany. 

Molecular & general genetics - MGG (GERMANY) Jul 1993 , 240 (1) 
pl40-5, ISSN 0026-8925 Journal Code: 0125036 
Document type: Journal Article 
Languages: ENGLISH 
Main Citation Owner: NLM 
Record type: Completed 
Subfile: INDEX MEDICUS 

Clostripain (EC 3.4.22.8) is a heterodimeric cysteine endopeptidase with 
strict specificity for Arg-Xaa peptidyl bonds. It is secreted by 
Clostridium histolyticum strains. For the first time we present evidence 
that both polypeptide chains of native clostripain are encoded by a single 
gene. DNA sequencing of two overlapping genomic DNA fragments revealed a 
single open reading- frame (ORF) of 1581 nucleotides encoding a polypeptide 
of 526 amino acid residues. The ORF is preceded by canonical transcription 
signals and both chains of the clostripain heterodimer are completely 
represented by the deduced coding sequence. Most interestingly, the 
sequences coding for the light and the heavy chain are joined by a DNA 
stretch coding for a linker nonapeptide that is preceded by the C-terminal 
arginyl residue of the light chain and also ends with an arginyl residue. 
Heterologous expression of the gene in Escherichia coli yielded an enzyme 
capable of hydrolyzing the clostripain substrates N 

alpha-benzoyl-L-arginine ethyl ester (BAEE) and ' N-carbobenzoxy-L-arginine 
p-nitroanilide (Z-Arg-pNA) . 

Tags: Support, Non-U. S. Gov't 

Descriptors: ^Clostridium — enzymology — EN; *Cysteine Endopeptidases 
— genetics — GE; *DNA, Bacterial — genetics — GE; *Genes , Bacterial — genetics 
— GE; Amino' Acid Sequence; Base Sequence; Clostridium — genetics — GE; 
Cysteine Endopeptidases — isolation and purification — IP; Escherichia coli; 
Gene Expression--genetics — GE; Molecular Sequence Data; Protein Sorting 
Signals--genetics — GE; Transcription, Genetic — genetics — GE 

Molecular Sequence Databank No.: GENBANK/X63673 

CAS Registry No.: 0 (DNA, Bacterial); 0 (Protein Sorting Signals) 
Enzyme No.: EC 3.4.22 (Cysteine Endopeptidases); EC 3.4.22.8 

( clostripain) 

Record Date Created: 19930902 
Record Date Completed: 19930902 



